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CHAPTER  1 


INTRODUCTION 

In  the  past  two  decades,  research  on  individual  differences  in 
aptitude  in  relation  to  instructional  variables  has  increased  substan¬ 
tially  (Cronbach,  1957;  Cronbach  and  Snow,  1977),  A  large  body  of 
literature  now  demonstrates  that  relations  between  learner  aptitudes 
and  learning  outcomes  vary  with  instructional  conditions*  Thus,  aptitude- 
treatment  interactions  (ATI)  clearly  exist.  However,  ATI  findings  have 
often  been  inconsistent  and  not  readily  applicable  to  real  world  instruc¬ 
tional  practice  (Snow,  1977).  Hence,  there  is  a  need  for  research  that 
can  provide  a  deeper  understanding  of  aptitudes,  instructional  treat¬ 
ments,  and  their  interactions. 

Background 

The  concept  of  general  mental  ability  is  central  to  most  current 
models  of  intelligence  and  ability  organization,  and  has  been  regarded 
as  the  most  important  aptitude  for  learning  from  instruction.  Measures 
of  general  ability  also  seem  to  show  the  strongest  and  most  consistent 
ATI.  This  is,  then,  the  logical  place  for  new  ATI  research  to  start. 

The  bulk  of  prior  evidence  is  consistent  with  the  following  hypo¬ 
thesis:  When  instruction  places  heavy  information  processing  burdens  on 
learners,  the  regression  of  learning  outcomes  onto  general  ability  differ¬ 
ences  is  relatively  steep;  able  students  do  well  and  less  able  students 
do  poorly.  In  contrast,  when  an  instructional  treatment  is  designed  to 
relieve  some  of  the  information  processing  burdens  on  learners  by  simpli¬ 
fying,  structuring,  or  elaborating  the  learning  task,  the  regression  of 
outcome  on  general  ability  is  relatively  shallow;  less  able  students 
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benefit,  more  able  students  sometimes  do  less  well  than  they  might  have 
done  without  help  (Snow,  1976). 

A  second,  popular  ATI  hypothesis  has  arisen  from  the  contrast  of 
verbal  and  non-verbal  ability.  This  is  the  major  distinction  beneath 
general  ability  in  a  hierarchical  view  of  ability  organization.  The  non¬ 
verbal  side  has  usually  been  thought  of  as  spatial  (Butcher,  1968; 

Vernon,  1950)  or  figural  and  symbolic  (Guilford,  1967)  ability,  or  as 
fluid-analytic  intelligence  (Cattell,  1963;  1971).  Factor  analytic 
studies  often  separate  verbal-crystallized  ability  (Gc)  from  fluid- 
analytic  and  visualization  ability  (see,  e.g.,  Snow,  Marshalek,  Lohman, 
Yalow  and  Webb,  1977).  But  the  further  distinction  between  fluid  ability 
and  spatial  visualization  has  often  been  difficult  to  make  (Lohman,  1979a 
Snow,  in  press) .  In  the  present  discussion,  Gfv  is  used  to  denote  a 
combined  fluid-analytic-visualization  dimension. 

This  second,  differential,  hypothesis  suggests  that  matching  the 
medium  of  instruction  (verbal  vs.  figural,  symbolic,  or  spatial)  to  the 
relative  strength  and  weakness  of  Gfv  and  Gc  in  learners  should  be  bene¬ 
ficial,  and  isolated  studies  based  on  this  hypothesis  have  reported  ATI. 
But  two  different  and  opposing  kinds  of  matching  are  possible,  and  both 
have  been  found;  matching  to  the  learner’s  strength  might  be  beneficial 
in  some  ways,  but  matching  to  the  learner’s  weakness  might  have  compensa¬ 
tory  benefits.  In  one  typical  study,  for  example,  Peterson  and  Hancock 
(1974)  taught  students  the  mathematics  of  network  tracing  using  either 
verbal,  symbolic,  or  figural  materials.  Aptitude  measures  were  selected 
from  Guilford’s  (1967)  system  to  represent  these  three  content  areas. 
Posttests  were  administered  immediately  after  instruction,  and  again 
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after  one  and  five  weeks.  The  regression  on  verbal  ability  was  shallower 


in  the  verbal  treatment  than  in  the  figural  or  symbolic  treatments  at 
all  three  testings,  suggesting  that  low  verbal  students  were  compensated 
by  a  verbal  treatment.  The  regressions  of  outcome  on  figural  and  symbolic 
ability  were  shallower  when  aptitude  matched  the  instructional  condition 
only  on  the  immediate  posttest.  On  both  retention  measures,  regressions 
were  steepest  for  figural  and  symbolic  ability  in  the  figural  and  symbolic 
treatments,  suggesting  that  one  should  match  to  strengths. 

The  results  of  other  studies  investigating  relations  between  verbal 
and  spatial  ability  in  instruction  have  also  been  inconsistent.  Allison 
(1960)  provided  instruction  on  concept  attainment  tasks  using  either 
verbal  stimuli  and  semantic  solution  rules  or  geometric  stimuli  and 
classification  solution  rules.  Verbal  and  spatial  aptitude  measures 
were  used.  Those  higher  in  verbal  ability  did  better  with  verbal  con¬ 
tent;  there  was  no  effect  for  spatial  ability.  Bracht  (1970)  taught 
addition  of  signed  numbers  using  figural  or  verbal  programmed  texts. 
Numerical,  verbal  and  spatial  aptitudes  were  measured.  There  were  no 
significant  ATI.  Markle  (1969)  taught  crystallography  using  programmed 
texts  composed  either  entirely  of  words  or  emphasizing  diagrams.  The 
pattern  of  correlations  of  outcome  with  verbal  and  spatial  measures  was 
similar  in  both  treatments.  In  a  series  of  studies  by  Carry  (1967), 

Webb  (1971),  and  Eastman  (1972),  students  were  taught  quadratic  inequali¬ 
ties  using  materials  designed  to  capitalize  on  spatial-visualization. 

After  reviewing  this  series,  Cronbach  and  Snow  (1977)  concluded,  "The 
three  studies  together  provide  only  negative  evidence  on  the  possible 
relevance  of  visualization  to  a  presentation  that  uses  graphs"  (p.285). 
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On  the  basis  of  these  and  other  mixed  results,  Cronbach  and  Snow 
(1977)  rejected  "the  conclusion  that  spatial  treatments  demand  spatial 
ability  and  that  differentiated  Guilford  abilities  will  interact  with 
treatments  of  the  same  name"  (p.  293).  They  did  not  rule  out  the  possi¬ 
bility  of  positive  findings  in  the  future,  however,  given  more  powerful 
and  penetrating  analyses. 

Gustafsson  (1974;  1976)  also  obtained  conflicting  results  in  a 
series  of  studies  exploring  the  verbal  vs.  figural  contrast.  Two  of 
these  studio s  used  a  text  on  polar  lights  as  a  verbal  treatment;  the 
pictorial  treatment  was  a  slightly  reduced  text  supplemented  with  illus¬ 
trations.  Aptitude  measures  included  a  vocabulary  test,  a  reasoning 
test,  and  a  spatial-visualization  test.  Learning  was  measured  by  a  short 
answer  test  and  an  essay  test.  Results  on  the  essay  test  were  not  con¬ 
sistent  across  the  two  studies.  On  the  short-answer  tests  in  both 
studies,  the  slope  of  the  regression  on  verbal  ability  was  steeper  in 
the  verbal  treatment  than  in  the  pictorial  treatment;  students  low  in 
Gc  did  best  with  pictures,  especially  if  they  were  also  high  on  Gv. 
Students  in  the  third  study  were  taught  about  the  heart  and  the  blood 
circulation  system  using  either  illustrated  or  unillustrated  materials. 
Immediate  and  delayed  outcome  measures  included  items  assessing  verbal, 
pictorial,  and  spatial  criteria.  Aptitude  measures  represented  Gc  and 
Gv.  There  were  no  substantial  ATI  for  the  verbal  or  spatial  criteria. 
Although  the  pictorial  treatment  was  best  for  everyone,  an  ordinal  inter¬ 
action  indicated  that  this  treatment  was  least  advantageous  for  students 
high  on  Gc  and  low  on  Gv. 

More  recent  research  has  not  changed  the  picture  appreciably.  James 
and  Knief  (1973),  for  example,  taught  students  to  determine  the  number  of 
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subsets  in  a  set  of  elements  using  a  treatment  designed  to  capitalize 
on  either  Gf  or  Gc.  A  sum  of  Gc  and  Gf  scores  represented  general 
ability,  and  the  difference  between  the  two  scores  represented  the  dif¬ 
ferential  hypothesis.  A  pretest  and  posttest  were  administered.  There 
were  significant  main  effects  and  ATI  with  the  sum,  but  neither  the 
difference  score  nor  any  of  its  interactions  were  significant.  Although 
high  ability  students,  on  average,  outperformed  low  ability  students, 
the  treatment  designed  to  capitalize  on  Gc  reduced  their  advantage. 

The  consistent  results  for  the  general-ability  hypothesis,  and  the 
inconsistent  results  for  the  differential  ability  hypothesis,  are  both 
understandable  in  hindsight.  The  wide  variety  of  instructional  treat¬ 
ment  contrasts  that  yield  ATI  with  general  ability  can  be  summarized  in 
terms  of  variation  in  amount  of  information  processing  demand,  but  this 
is  only  a  crude  summary,  at  best.  The  demand  characteristics  of  differ¬ 
ent  kinds  of  instruction  are  not  understood  in  detail,  nor  is  a  process 
theory  of  ability  for  analyzing  task  demands  in  relation  to  individual 
differences  available.  Treatments  are  usually  poorly  specified,  and 
this  hampers  our  pursuit  of  both  the  general  and  the  differential-ability 
hypothesis.  An  enormous  range  of  instructional  materials  have  been 
labeled  ’’spatial"  or  "verbal"  with  little  thought  about  their  processing 
demands.  The  presence  of  figures  or  pictures  does  not  indicate  that  a 
treatment  requires  spatial  ability.  Diagrams  can  tax  ability  but  they 
can  also  compensate  for  weakness.  Similarly,  it  is  insufficient  to 
attach  global  labels  to  categories  of  ability.  A  "spatial"  ability  test 
does  not  necessarily  measure  spatial  ability  (Lohman,  1979a;  1979b). 
Aptitude  measures  should  be  understood  in  terms  of  amount  and  kind  of 
processing  demand. 
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There  is  not  likely  to  be  a  simple  match  of  aptitudes  and  treatments 


(Cronbach  and  Snow,  1977;  Salomon,  1972).  Some  kinds  of  instruction 
build  upon  the  learner’s  capacities  or  preferences,  requiring  students 
to  bring  possessed  abilities  to  bear  in  learning.  Alternatively,  in¬ 
structional  materials  may  do  for  learners  what  they  cannot  do  for  them¬ 
selves,  and  so  may  reduce  ability-outcome  correlations.  Less  able 
students  might  profit  from  such  assistance,  whereas  able  students  might 
be  turned  away  by  it.  Further,  learners  may  substitute  abilities  they 
possess  for  those  they  lack.  Thus,  graphic  problems  might  be  solved  by 
either  verbal  processing  strategies  or  by  direct  manipulation  of  lines 
and  curves. 

Thus,  the  inconsistency  and  complexity  of  earlier  ATI  results  seem 
due,  in  part,  to  the  failure  to  specify  requisite  abilities  for  carefully 
delineated  treatments,  or  to  provide  a  common  process  description  for 
aptitude  and  learning  tasks.  There  has  also  been  inadequate  considera¬ 
tion  given  to  the  multiple  ways  in  which  aptitudes  and  treatments  might 
be  matched.  The  notion  that  students  of  high  spatial  ability  necessarily 
do  better  in  spatial  treatments  ignores  the  complexities  of  both  ability 
and  instructional  material. 

The  Present  Study 

The  primary  hypothesis  investigated  here  related  general  ability  to 
learning.  First,  students  higher  in  general  ability  were  expected  to 
obtain  higher  posttest  scores  on  the  average  than  lower  ability  students. 
Further,  both  verbal  and  figural  supplements  were  expected  to  reduce  the 
slope  of  the  regression  of  outcome  on  general  ability.  The  effect  of 
verbal  supplements  was  expected  to  be  greatest  on  verbal  outcome  measures 
the  effect  of  figural  supplements  was  expected  to  be  greatest  on  figural 


outcome  measures. 
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possess  for  those  they  lack.  Thus,  graphic  problems  might  be  solved  by 
either  verbal  processing  strategies  or  by  direct  manipulation  of  lines 
and  curves. 

Thus,  the  inconsistency  and  complexity  of  earlier  ATI  results  seem 
due,  in  part,  to  the  failure  to  specify  requisite  abilities  for  carefully 
delineated  treatments,  or  to  provide  a  common  process  description  for 
aptitude  and  learning  tasks.  There  has  also  been  inadequate  considera¬ 
tion  given  to  the  multiple  ways  in  which  aptitudes  and  treatments  might 
be  matched.  The  notion  that  students  of  high  spatial  ability  necessarily 
do  better  in  spatial  treatments  ignores  the  complexities  of  both  ability 
and  instructional  material. 

The  Present  Study 

The  primary  hypothesis  investigated  here  related  general  ability  to 
learning.  First,  students  higher  in  general  ability  were  expected  to 
obtain  higher  posttest  scores  on  the  average  than  lower  ability  students. 
Further,  both  verbal  and  figural  supplements  were  expected  to  reduce  the 
slope  of  the  regression  of  outcome  on  general  ability.  The  effect  of 
verbal  supplements  was  expected  to  be  greatest  on  verbal  outcome  measures 
the  effect  of  figural  supplements  was  expected  to  be  greatest  on  figural 


outcome  measures. 
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The  present  study  also  explored  the  differential  impact  of  Gc  and 
Gfv  on  learning.  It  was  hypothesized  that  students  high  in  either  Gc 
or  Gfv  would  learn  more  than  lower  ability  students.  It  was  further 
hypothesized  that  Gc  and  Gfv  would  moderate  the  relations  between  instruc¬ 
tion  and  outcome  differently.  Verbal  supplements  were  expected  to  be 
particularly  useful  to  students  low  in  Gc;  figural  supplements  were 
expected  to  be  particularly  useful  to  students  low  in  Gfv. 

Finally,  the  study  was  planned  to  examine  long-term  as  well  as 
inmediate  learning.  A  reduction  in  average  scores  from  immediate  to 
delayed  posttest  was  expected.  The  greatest  drop  in  performance  was 
expected  when  instructional  supplements  were  used.  By  reducing  process¬ 
ing  demands,  supplements  might  enhance  short-term  learning  while  reducing 
long-term  learning.  This  effect  would  be  particularly  evident  where 
instructional  content  and  outcome  were  matched.  That  is,  losses  on 
verbal  outcome  measures  would  be  greater  when  verbal  supplements  were 
used  than  when  figural  supplements  were  used;  losses  on  figural  outcome 
measures  would  be  greater  with  figural  supplements. 

To  summarize,  this  study  assessed  the  relations  among  aptitudes, 
instructional  supplements,  and  learning  outcomes.  Gc  and  Gfv  were  the 
aptitudes  of  particular  interest,  although  more  specific  aptitudes  were 
also  included.  Instructional  materials  differed  in  the  use  of  verbal 
and  figural  supplements.  Outcome  measures  distinguished  verbal  from 
figural  responses.  In  general,  this  study  was  intended  to  illuminate 
the  relations  between  aptitude  and  instructional  treatment. 
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CHAPTER  2 


METHOD 

A  2-week  course  in  Economics  was  presented  to  high  school  students 
using  one  of  three  sets  of  instructional  materials.  Beforehand,  partici¬ 
pants  completed  a  3-hour  aptitude  test  battery  and  were  randomly  assigned 
to  treatment.  One  posttest  was  administered  at  the  end  of  the  course; 
another  was  given  two  weeks  later. 

Sample 

Participants  were  recruited  from  three  Palo  Alto,  California,  high 
schools.  Tenth-  and  eleventh-grade  students  responded  to  an  advertise¬ 
ment  in  a  local  newspaper  and  were  paid  an  hourly  fee  for  their  partici¬ 
pation.  The  initial  3-hour  aptitude  session  included  a  10-item  screening 
test  to  eliminate  those  already  familiar  with  the  instructional  content. 
Of  the  146  students  who  initially  responded,  132  were  retained  and 
completed  the  experiment.  The  final  sample  included  86  females  and  46 
males;  44  participants  were  assigned  to  each  of  the  three  conditions. 

Treatments 

The  basic  instruction  covered  the  theory  of  supply  and  demand, 
determination  of  market  price,  elasticity  of  supply  and  demand,  and  the 
application  of  these  principles  to  price  floors  and  ceilings,  taxation, 
and  agricultural  problems.  Materials  were  adapted  from  introductory 
college  economics  textbooks  (Lipsey  and  Steiner,  1969;  Samuelson,  1976; 
Spencer,  1977;  Sutton,  1976),  but  presented  at  a  level  appropriate  for 
high  school. 

The  same  material  was  covered  in  each  treatment  condition.  Treat¬ 
ments  varied,  however,  in  the  explanatory  displays  and  the  difficulty  of 
the  processing  demands.  The  three  instructional  conditions  were  Minimal 
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(MIN),  Verbal  Elaboration  (VE) ,  and  Figural  Elaboration  (FE) . 


Information  in  MIN  was  presented  with  little  redundancy,  few 
examples,  and  limited  explana tions.  Participants  were  encouraged  to 
solve  problems  on  their  own  and  to  generate  their  own  explanations  for 
facts  and  principles.  Principles  were  presented  with  limited  verbal 
explanations  and  figural  displays. 

VE  c  ered  the  same  basic  information  as  MIN,  but  with  additional 
verbal  material.  Examples  were  given,  verbal  explanations  were  presented, 
and  basic  concepts  were  redefined  as  learners  encountered  new  material. 
Figural  content  was  identical  to  that  of  MIN. 

FE  also  covered  the  same  basic  MIN  material,  but  with  additional 
graphs  and  diagrams.  Examples  and  exercises  using  graphs  in  problem 
solving  were  added.  Additional  verbiage  was  used  only  to  help  students 

understand  and  manipulate  diagrams.  The  differences  among  treatments  are 
summarized  in  Table  1. 

Each  treatment  consisted  of  eight  50-rainute  instructional  sessions. 
Participants  were  limited  to  one  instructional  session  per  day. 

Materials 

Aptitude  Measures 

Four  tests  were  selected  to  measure  Gfv:  The  Advanced  Progressive 
Matrices  Test  (Raven,  1962),  Paper  Folding  Test  (French,  Ekstrom,  and 
Price,  1963),  Copying  Test  (French  et  al.,  1963),  and  Memory  for  Designs 
(Graham  and  Kendall,  1960).  Measures  of  Gc  included  the  Terman  Concept 
Mastery  Test  (Terman,  1956),  Advanced  Vocabulary  Test  V-4  (French  et  al. , 
1963),  and  a  fill-in  vocabulary  test  adapted  from  the  Wechsler  Adult 
Intelligence  Scale  (Wechsler,  1955).  The  latter  consisted  of  20  words 
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TABLE  1 


Summary  of  Treatment  Specifications 


Component  of  Instruction 


General  information 


Basic  statement  of  economic  principles 
Verbal 
Figural 

Explanation  of  principles 
Verbal 
Figural 

Examples 

Practice  problems 

Solutions  to  practice  problems 

Explanations  of  solutions  for  practice  problems 
Verbal 
Figural 

Redundancy 

Verbal 

Figural 

Underlining 


from  the  WAIS  vocabulary  section,  representing  the  full  range  of  item 
difficulty.  Items  were  scored  using  the  guidelines  presented  in  the 
test  manual. 

A  test  of  graph  processing  (GRAPH)  was  designed  and  administered  to 
supplement  the  broader  ability  tests.  This  test  measured  the  ability  to 


the  end  of  the  session  to  review  it,  thereby  equalizing  students’  working 


times. 

Outcome  Measures 

An  immediate  and  a  delayed  posttest  were  administered  after  the  comple¬ 
tion  of  each  instructional  unit.  The  two  posttests  were  similar  in  format 
and  content.  Each  test  ^orxsisted  of  40  items  and  covered  most  of  the  material 
presented  during  the  ir structional  period.  On  each  test,  15  items  re¬ 
quired  students  to  answer  verbal  questions,  15  required  students  to  deal 
with  ftgural  information,  and  10  questions  required  both  verbal  and 
figural  explanations,  for  these  10  items,  the  student  was  asked  to  indi¬ 
cate  the  explanation  given  first.  Within  each  of  these  categories,  items 
required  either  the  application  of  principles  to  solve  a  problem,  or 
simple  recall  or  recognition  of  information  specifically  discussed  during 
instruction.  Response  formats  included  multiple-choice,  fill  in,  and 
short  problems.  Approximately  one  hour  was  alloted  to  complete  each 
posttest.  Table  2  lists  the  parts  of  each  posttest,  including  item  type 
(figural  vs.  verbal),  response  format,  and  the  number  of  items. 

Procedure 

Instruction  began  approximately  one  week  after  aptitude  testing. 
Participants  attended  one  50  minute  session  fcr  each  of  four  days  during 
the  first  week  of  instruction,  and  for  each  of  five  days  during  the 
second  week  of  instruction.  Students  completed  one  packet  of  material 
during  each  session.  Sessions  were  held  hourly  between  3  p.m.  and  11  p.m. 
on  weekdays,  and  between  9  a.m.  and  5  p.m.  during  the  weekend.  Students 
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Summary  of  Outcome  Measures 
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Numbers  of  verbal  first  and  figural  first  apply  only  to  correct  explanations  on  the  problems. 


were  allowed  to  schedule  themselves  freely  except  that  no  more  than  30 
students  could  be  accomodated  in  a  single  session. 

Upon  arrival,  students  would  take  a  folder  bearing  their  name,  and 
remove  the  appropriate  packet.  At  the  end  of  the  session,  students 
returned  the  packet  to  the  investigator,  who  checked  that  the  student 
had  worked  on  the  proper  materials.  It  was  not  necessary  to  schedule 
participants  in  the  same  treatment  for  the  same  hours  as  all  students 
worked  individually. 

The  immediate  posttest  was  administered  to  each  student  after  all 
instructional  materials  were  completed.  All  participants  took  the  immediate 
posttest  on  the  Friday  of  the  second  week  of  instruction.  Most  partici¬ 
pants  returned  for  the  delayed  posttest  two  weeks  later;  those  who  could 
not  were  scheduled  individually  for  their  delayed  posttest.  All  partici¬ 
pants  completed  the  delayed  posttest  within  11  to  17  days  after  the 
immediate  posttest. 
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Summary  of  Outcome  Measures 
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Numbers  of  verbal  first  and  figural  first  apply  only  to  correct  explanations  on  the  problems. 


were  allowed  to  schedule  themselves  freely  except  that  no  more  than  30 
students  could  be  accomodated  in  a  single  session. 

Upon  arrival,  students  would  take  a  folder  bearing  their  name,  and 
remove  the  appropriate  packet.  At  the  end  of  the  session,  students 
returned  the  packet  to  the  investigator,  who  checked  that  the  student 
had  worked  on  the  proper  materials.  It  was  not  necessary  to  schedule 
participants  in  the  same  treatment  for  the  same  hours  as  all  students 
worked  individually. 

The  immediate  posttest  was  administered  to  each  student  after  all 
instructional  materials  were  completed.  All  participants  took  the  immediate 
posttest  on  the  Friday  of  the  second  week  of  instruction.  Most  partici¬ 
pants  returned  for  the  delayed  posttest  two  weeks  later;  those  who  could 
not  were  scheduled  individually  for  their  delayed  posttest.  All  partici¬ 
pants  completed  the  delayed  posttest  within  11  to  17  days  after  the 
immediate  posttest. 


14 


CHAPTER  3 


RESULTS 

Descriptive  Statistics 

Aptitude  Measures 

Means,  standard  deviations,  reliabilities,  and  correlations  among 
aptitude  measures  for  the  entire  sample  are  presented  in  Table  3. 

Similar  tables  for  the  separate  samples  in  MIN,  VE,  and  FE  are  presented 
in  Appendix  A. 

Scores  from  the  two  vocabulary  tests  and  the  Terman  Concept  Mastery 
Test  were  standardized  in  the  total  sample  and  combined  to  form  a  com¬ 
posite  labeled  Gc.  The  Gfv  composite  included  standardized  scores  for 
Copying,  Memory  for  Designs,  Paper  Folding,  and  the  Advanced  Progressive 
Matrices  Test.  Although  GRAPH  showed  high  correlations  with  both  com¬ 
posites  it  was  left  as  a  separate  third  aptitude  since  it  was  thought  to 
be  specifically  relevant  to  this  instructional  setting.  About  42%  of 
its  variance  was  estimated  to  be  specific. 

Tests  included  in  the  Gc  composite  showed  nigher  correlations  with 
each  other  than  with  measures  of  Gfv.  Copying  and  Memory  for  Designs 
had  higher  correlations  with  other  measures  of  Gfv  than  with  indicators 
of  Gc.  As  expected,  however,  Paper  Folding  and  the  Advanced  Progressive 
Matrices  Test,  both  complex  measures  of  Gfv,  showed  higher  correlations 
with  Gc  than  did  Memory  for  Designs  and  Copying. 

The  Gc  and  Gfv  composites  were  combined  to  form  two  orthogonal 
indices  to  investigate  their  combined  and  differential  importance:  SUM, 
the  3um  of  the  Gc  and  Gfv  composites  represented  general  ability,  and 
DIFF,  Gc  minus  Gfv  represented  the  ability  profile  difference.  Positive 
values  on  DIFF  thus  indicate  students  higher  in  Gc  than  Gfv,  and  negative 
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values  indicate  an  advantage  in  Gfv.  This  procedure  has  advantages  over 
using  Gc  and  Gfv  directly  in  the  analysis  (Cronbach  and  Snow,  1977).  First 
hypotheses  can  be  ordered  so  that  the  general-ability  hypothesis  can  be 
tested  independently  from  the  more  exploratory  differential-ability 
hypothesis.  Second,  SUM  and  DIFF  are  uncorrelated,  whereas,  Gc  and  Gfv 
usually  correlate  (r  =  .54  in  this  study).  Thus,  using  SUM  and  DIFF 
provides  less  ambiguous  interpretations  than  using  Gc  and  Gfv  directly. 

Although  students  were  randomly  assigned,  the  equivalence  of  groups 
was  checked.  There  were  no  significant  mean  or  variance  differences 
among  the  treatment  conditions  on  any  of  the  aptitude  measures  and  only 
minor  differences  in  the  pattern  of  correlations  in  the  three  treatments. 
Thus,  no  systematic  aptitude  differences  among  the  three  groups  could  be 
identified. 

Outcome  Measures 

Reliabilities  of  the  immediate  and  delayed  posttest  were  estimated 
at  .82  and  .84,  respectively,  using  coefficient  alpha.  Correlations 
between  total  scores  and  major  part  scores  on  the  immediate  and  delayed 
posttests  are  presented  in  Appendix  B. 

Means  and  standard  deviations  for  all  parts  of  both  posttests  are 
reported  in  Table  4.  While  treatment  differences  on  the  immediate 
posttest  were  not  always  large,  group  averages  in  VE  and  FE  were  consist¬ 
ently  higher  on  items  corresponding  to  the  type  of  assistance  the  group 
had  received.  On  the  average,  students  in  VE  and  FE  also  obtained  higher 
scores  on  the  problems  than  students  in  MIN.  Thus,  the  mean  total  post¬ 
test  score  was  higher  for  VE  and  FE  than  for  MIN. 

This  pattern  was  not  found  in  the  delayed  posttest  means,  also 
reported  in  Table  4.  Here,  MIN  showed  a  slight  overall  advantage  over 
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Means  and  Standard  Deviations  on  Posttest 
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VE  and  FE  on  total  posttest  score.  Thus,  although  elaborating  and 
simplifying  instruction  (as  in  VE  and  FE)  apparently  produced  immediate 
gains,  information  acquired  through  such  instruction  was  not  retained  as 
well  as  it  was  in  MIN.  MIN  actually  showed  a  small  average  gain  from 
immediate  to  delayed  tests,  while  VE  and  FE  showed  losses.  The  greatest 
losses  occurred  when  treatment  and  posttest  item  type  were  alike.  That 
is,  the  greatest  losses  on  verbal  items  appeared  in  VE;  the  greatest 
losses  on  figural  items  appeared  in  FE. 

On  the  immediate  posttest,  there  were  small  differences  among  groups 
in  the  type  of  explanation  given  on  the  problems.  On  average,  students 
in  VE  gave  correct  verbal  explanations  more  frequently  than  did  students 
in  either  MIN  or  FE.  Verbal  explanations  were  also  more  likely  to  be 
given  before  figural  explanations  by  students  in  VE.  Similarly,  in  FE, 
correct  figural  explanations  to  the  problems  were,  on  average,  given 
before  verbal  explanations.  Similar  trends  were  found  on  the  delayed 
posttest. 

Correlations  Between  Aptitude  and  Outcome 

Table  5  gives  selected  correlations  between  outcome  and  aptitude 
measures.  (See  Appendix  C  for  complete  correlation  matrices.) 

Significant  correlations  between  SUM  and  all  outcome  measures 
appeared  in  all  treatments,  while  correlations  for  DIFF  never  differed 
from  zero.  The  correlations  between  total  posttests  and  SUM  were  similar 
in  the  three  treatments,  but  part  scores  on  the  immediate  posttest 
showed  some  variation  across  treatments,  particularly  when  item  type  and 
treatment  matched.  So,  for  example,  SUM  and  the  immediate  verbal  total 
correlated  .50  in  VE,  but  .74  and  .72  in  MIN  and  FE,  respectively. 
Similarly,  SUM  and  the  immediate  figural  total  correlated  .50  in  FE,  but 


.68  and  .69  in  MIN  and  VE.  On  the  problems  on  the  immediate  post test, 
correlations  with  SUM  were  virtually  identical  in  VE  and  FE,  somewhat 
smaller  than  in  MIN.  On  the  delayed  post test,  the  correlation  between 
SUM  and  the  verbal  total  was  largest  in  VE. 

Correlations  between  GRAPH  and  the  immediate  outcome  were  generally 
lower  than  those  between  SUM  and  outcome.  There  were,  however,  also 
lower  correlations  between  GRAPH  and  immediate  outcome  when  item  type  and 
treatment  were  matched.  On  the  delayed  posttest,  there  were  only  small 
differences  among  the  treatments  in  the  correlations  of  GRAPH  with  total 
posttest  score  and  the  problems.  The  correlation  between  the  verbal 
total  and  GRAPH  was  greatest  in  FE.  The  figural  total  and  GRAPH  had  the 
largest  correlation  in  MIN. 

Regression  Analyses  on  Outcome 

Cronbach  and  Snow  (1977)  recommended  generalized  regression  analysis 
for  investigating  ATI.  The  model  for  the  present  study  took  the  form: 

Y-  b0  +  sss  +  eBD  +  sG  o  +  eT  i  +  sST  st  + 

3dt  dt  +  6gt  gt 

where : 

Y  =  dependent  variable 
£q  =  constant  term 

8  ,  6  »  6  *  regression  coefficients  for  SUM,  DIFF,  GRAPH 

S  D  G 

S,  D,  G  =  score  on  SUM,  DIFF,  GRAPH 

8^  a  regression  coefficient  for  treatment 

T  *  orthogonal  treatment  contrast 

8ST’  ^DT*  ^GT  =  regression  coefficients  for  first-order  ATI 

Two  orthogonal  contrasts  represented  the  three  treatments.  The 


first  contrast  (Tl)  compared  MIN  with  VE  and  FE  (coded  2,  -1,  -1, 
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respectively).  The  second  contrast  (12)  compared  VE  with  FE  (coded  1, 
-1,  respectively,  with  MIN  coded  0).  ATI  terms  were  computed  by  multi¬ 
plying  treatment  contrasts  with  aptitude. 

Separate  analyses  were  conducted  for  total  and  part  scores  on 
immediate  and  delayed  posttest.  A  step-up  procedure  was  used,  with 
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-1,  respectively,  with  MIN  coded  0).  ATI  terms  were  computed  by  multi¬ 
plying  treatment  contrasts  with  aptitude. 

Separate  analyses  were  conducted  for  total  and  part  scores  on 
immediate  and  delayed  posttest.  A  step-up  procedure  was  used,  with 
variables  forced  into  the  equation  in  a  specified  order.  Aptitude  main 
effects  were  entered  before  treatment  effects,  with  SUM  entered  first  to 
test  the  general-ability  hypothesis.  DIFF  was  entered  next  to  test  the 
differential-ability  hypothesis.  GRAPH  was  then  entered  to  assess  its 
specific  contribution,  independent  of  the  proportion  of  its  effects  that 
were  associated  with  general  and  differential  ability.  Treatment  main 
effects  were  then  entered  using  the  two  orthogonal  contrasts.  Following 
all  main  effects,  first-order  ATI  were  entered.  This  order  was  used  for 
all  dependent  variables. 

Total  Score  Analyses 

The  results  of  the  regression  analyses  for  the  immediate  and  delayed 

total  posttests  are  presented  in  Table  6.  The  percentage  of  variance 

accounted  for  reflects  the  change  in  the  squared  multiple  correlation 

coefficient  as  each  predictor  entered  the  equation.  The  F-ratio  tested 

2 

whether  this  change  in  R^  was  significant  using  the  formula: 

F,  -r2/ki 

1-R2  /N-K  -1 

where : 

2 

1R  =  increment  in  R2 

=  number  of  predictors  in  change 
2  2 

R  *  R  for  full  model 
N  *  total  sample  size 
K  *  number  of  predictors  in  total  R^ 
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Aptitude  main  effects,  specifically  SUM  and  GRAPH,  accounted  for 
the  largest  proportion  of  explained  variance  in  both  immediate  and 
delayed  posttests.  High  ability  students  thus  routinely  achieved  higher 
outcome  scores  than  did  low  ability  students,  DIFF  accounted  for  only 
,4%  of  the  variance  in  the  immediate  posttest  and  only  .1%  of  the  variance 
in  the  delayed  posttest. 

Treatment  main  effects  accounted  for  4.6%  of  the  variance  in  the 
immediate  posttest  and  2.7%  of  the  variance  in  the  delayed  posttest. 

These  effects  correspond  to  the  mean  differences  identified  in  Table  4. 

VE  and  FE  showed  higher  immediate  outcomes  than  did  MIN;  differences 
between  VE  and  FE  were  not  significant.  In  contrast,  on  the  delayed 
posttest,  MIN  showed  a  higher  average  than  did  VE  and  FE. 

Interactions  were  tested  in  the  generalized  model.  Then,  unstand¬ 
ardized  regression  coefficients  within  each  treatment  were  plotted 
graphically  for  interpretation.  The  SUM  X  treatment  interaction  was 
statistically  significant  on  the  immediate  posttest,  but  not  on  the 
delayed  post test.  This  interaction  accounted  for  2.9%  of  the  immediate 
posttest  variance;  it  is  shown  graphically  in  Figure  1.  The  relative 
advantage  of  high  ability  students  was  most  pronounced  in  MIN.  In  other 
words,  VE  and  FE  appeared  particularly  helpful  for  low  ability  students, 
reducing  the  difference  between  them  and  high  ability  students.  There 
were  smaller  treatment  differences  at  the  mean  and  reduced  ATIs  on  the 
delayed  posttest. 

Part  Score  Analyses 

Verbal  items.  The  results  of  the  generalized  regression  analyses 
of  verbal  outcome  measures  appear  in  Table  7.  Again,  SUM  accounted  for 
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Summary  of  Stepwise  Regression  of  Verbal  Items 
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the  largest  proportion  of  explained  variance  in  each  posttest,  GRAPH 
accounted  for  a  significant  proportion  of  variance  in  the  immediate 
posttest,  accounting  for  4.6%  of  the  variance  in  it.  Students  with 
higher  aptitude  scores  had  higher  outcome  scores.  DIFF  was  never  signi¬ 
ficant. 

Treatment  main  effects  correspond  to  the  mean  differences  shown 
previously.  On  the  immediate  posttest,  treatment  main  effects  accounted 
for  10.0%  of  the  variance;  VE  was  superior  to  MIN  and  FE.  Treatment  main 
effects  accounted  for  2.9%  of  the  variance  in  the  delayed  posttest,  with 
MIN  superior  to  VE  and  FE. 

First-order  ATI  accounted  for  5.8%  of  the  variance  in  the  immediate 
post  test;  ATI  with  SUM  accounted  for  most  of  this  variance.  Figure  2 
shows  that  VE  was  particularly  helpful  for  low  ability  students. 

The  only  significant  ATI  on  the  delayed  post test  was  between  GRAPH 
and  treatment.  As  shown  in  Figure  3,  only  in  FE  did  students  with  high  GRAPH 
scores  outperform  students  with  low  GRAPH  scores. 

Figural  Items.  Table  8  presents  the  results  of  the  generalized 
regression  analyses  for  figural  items.  As  before,  SUM  and  GRAPH  accounted 
for  a  significant  proportion  of  the  variance  on  both  the  immediate  and 
delayed  posttests.  DIFF  was  not  significant,  accounting  for  only  .5%  of 
the  immediate  posttest  variance  and  ,0%  of  the  delayed  posttest  variance. 

Treatment  main  effects  accounted  for  6.6%  of  the  variance  on  the 
immediate  posttest  and  6.0%  of  the  variance  on  the  delayed  posttest. 

Thus,  on  the  immediate  posttest,  VE  and  FE  were  superior  to  MIN,  on 
average.  FE  had  a  slight  advantage  over  VE.  In  contrast,  on  the  delayed 
posttest,  both  MIN  and  VF  were  superior  to  FE. 
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Figure  3.  Relation  of  Verbal  It 
Unstandardized  Regres 
Shown  in  Parentheses. 
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The  SUM  X  treatment  interaction  was,  again,  significant  on  the 
immediate  and  not  on  the  delayed  posttest.  In  Figure  4,  the  immediate 
regression  sl^pe  on  SUM  was  shallower  in  FE  than  in  MIN  and  VE.  FE  was 
particularly  helpful  to  low  ability  students  in  reducing  differences 
between  them  and  high  ability  students.  The  mean  disadvantage  on  the 
delayed  posttest  for  students  in  FE  can  also  be  seen. 

Interactions  with  GRAPH  accounted  for  1.2%  of  the  variance  in  the 
immediate  posttest,  and  1.4%  of  the  variance  in  the  delayed  posttest. 

Uhile  not  significant  in  the  immediate  posttest,  the  interaction  suggested 
that  FE  reduced  the  advantage  of  students  with  high  GRAPH  scores.  On 
the  delayed  posttest,  both  VE  and  FE  reduced  the  advantage  of  students 
with  high  GRAPH  scores.  These  relations  are  shown  in  Figure  5. 

Problems.  As  with  all  other  dependent  variables,  SUM  accounted  for 
the  largest  proportion  of  explained  variance  in  the  problems  on  both  the 
immediate  and  delayed  posttests  (see  Table  9).  Although  GRAPH  was  not 
a  statistically  significant  predictor  of  the  immediate  posttest,  it 
accounted  for  1.3%  of  its  variance,  and  did  account  for  a  significant 
proportion  of  variance  (4.9%)  on  the  delayed  posttest.  Again,  the  effects 
of  DIFF  were  small.  No  treatment  main  effects  were  significant,  and  tne 
only  significant  ATI  was,  again,  with  SUM  on  the  immediate  posttest.  As 
shown  in  Figure  6,  differences  between  high  and  low  ability  students  were 
greatest  in  MIN. 

Summary  of  Regression  Analyses 

SUM  accounted  for  the  vast  majority  of  variance  in  all  dependent 
measures,  and  GRAPH  accounted  for  significant  proportions  of  variance  in 
most.  Again,  because  GRAPH  and  SUM  were  correlated,  and  because  SUM  was 
entered  into  the  regression  analyses  before  GRAPH,  effects  associated 
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Summary  of  Stepwise  Regression  of  Figural  Items 
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Relation  of  Figural  Items  to  GRAPH,  with 
Unstandardized  Regression  Coefficients 
Shown  in  Parentheses. 
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with  GRAPH  reflect  its  specific  variance,  not  variance  shared  with  SUM. 

The  differential-ability  hypothesis,  tested  by  DIFF  was  never  significant. 

Treatment  main  effects  were  significant  for  the  total,  verbal,  and 
figural  scores  on  both  posttests.  The  immediate  posttest  showed  a  gen¬ 
eral  advantage  for  VE  and  FE  over  MIN.  VE  was  most  advantageous  on 
verbal  items,  FE  on  figural  items.  But  this  trend  was  reversed  on  the 
delayed  posttest  where  students  in  MIN  outperformed  those  in  VE  and  FE. 

In  fact,  the  lowest  mean  figural  part  score  occurred  in  FE. 

All  significant  ATI  on  the  immediate  posttest  involved  general 
ability  (SUM) ;  VE  and  FE  reduced  the  advantage  of  high  ability  students. 
This  advantage  was  reduced  most  in  VE  on  verbal  items  and  in  FE  on  figural 
items. 

Similar  relations  were  not  found  on  the  delayed  posttest.  High 
ability  students  continued  to  outperform  low  ability  students  on  the 
delayed  posttest;  the  relative  advantages  for  students  in  VE  and  FE  were 
not  retained  over  time.  Significant  ATI  were  obtained  only  with  GRAPH. 

Thus,  in  this  study,  instruction  that  was  most  effective  for  im¬ 
mediate  learning  was  not  most  effective  in  the  long  run.  ATI  effects 
suggested  that  this  shift  may  have  come  primarily  from  low  ability  students 
who  did  not  retain  the  additional  information  that  enhanced  immediate 
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CHAPTER  4 


SUMMARY  AND  CONCLUSIONS 

The  present  study  examined  the  effects  of  verbal  and  figural  sup¬ 
plements  on  learning,  and  their  relation  to  general  ability,  Gc  and 
Gfv,  and  a  specific  graph-processing  test.  This  chapter  summarizes 
prior  research  and  the  procedures  and  results  of  the  present  study. 
Implications  for  future  research  and  educational  practice  are  discussed. 

The  Research  Problem 

Although  numerous  studies  have  investigated  interactions  between 
aptitude  and  verbal  and  figural  instructional  supplements,  few  consistent 
relations  have  been  established.  In  some  studies,  the  regression  of 
outcome  on  aptitude  was  steeper  when  aptitude  and  instructional  condi¬ 
tion  were  matched;  in  others,  the  regression  was  shallower.  Some  studies 
obtained  similar  regression  slopes.  These  relations  varied,  in  part, 
as  a  function  of  the  delay  between  instruction  and  the  posttest.  Incon¬ 
sistencies  also  resulted  from  not  specifying  aptitude  and  treatment  in 
sufficient  detail  or  considering  all  the  ways  they  might  be  matched. 

The  Present  Research 

The  present  study  compared  the  effects  of  minimal  instruction, 
instruction  elaborated  with  verbal  supplements,  and  instruction  elaborated 
with  figural  supplements.  Aptitude  was  represented  by  a  Gc  composite, 
a  Gfv  composite,  and  a  graph-processing  test.  The  Gc  and  Gfv  composites 
were  summed  to  indicate  general  ability  and  their  difference  was  used 
to  investigate  their  differential  impact.  Immediate  and  delayed  outcome 
measures  included  verbal  items,  figural  items, arri  problems  that  could  be 
solved  either  verbally  or  figurally.  Learning  was  described  as  a  function 
of  aptitude  and  instructional  material. 
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with  GRAPH  reflect  its  specific  variance,  not  variance  shared  with  SUM. 
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The  Effects  of  General  Ability 

On  the  immediate  posttest,  students  in  VE  and  FE  did  better  than 
students  in  MIN,  suggesting  that  the  elaboration  provided  in  these  con¬ 
ditions  helped  students  learn.  Significant  ATI  suggested  that  the  elabo¬ 
ration  was  particularly  useful  to  low  ability  students.  High  ability 
students  did  as  well  or  better  in  MIN.  Thus,  the  regression  of  a 
ment  on  general  ability  was  steepest  in  MIN  and  reduced  in  VE  and  Ft. 

Partitioning  of  the  total  test  score  by  item  type  indicated  that 
VE  was  particularly  helpful  on  verbal  items  and  FE  was  particularly  helpful 
on  figural  items.  Again,  significant  ATI  indicated  that  these  treatments 
were  particularly  helpful  to  low  ability  students.  Hence,  the  regression 
of  verbal  items  on  general  ability  was  least  steep  in  VE;  the  regression 
of  figural  items  on  general  ability  was  least  steep  in  FE. 

Examination  of  learning  outcomes  on  retention,  however,  led  to 
strikingly  different  conclusions.  While  students  in  MIN  were  worst  on 
average  achievement  on  the  immediate  posttest,  they  performed  best  on 
the  delayed  test. 

VE  and  FE  provided  more  information  to  learners  through  additional 
explanations  and  examples.  MIN  required  students  to  provide  this  infor¬ 
mation  for  themselves,  thereby  demanding  more  active  work  from  learners. 
More  able  students,  capable  of  doing  it,  did  equally  well  in  MIN  as  in 
VE  and  FE,  Less  able  students  that  could  not  generate  that  information 
for  themselves  benefitted  from  the  assistance.  The  gains  were  short-lived, 
however.  Active  mental  work,  necessary  in  MIN,  appeared  to  aid  retention. 
Hence,  there  was  a  greater  decline  in  performance  in  VE  and  FE  than  in 
MIN.  This  contention  was  further  supported  by  the  observation  that  losses 
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were  greatest  when  the  assistance  was  most  direct*  That  is,  losses  on 
verbal  items  were  greatest  in  VE;  losses  on  figural  items  were  greatest 
in  FE. 

The  Differential  Effect  of  Gc  and  Gfv 

The  differential  impact  of  Gc  and  Gfv  as  measured  in  this  study, 
did  not  enter  differently  into  outcome  on  aptitude  relations.  This  may 
be  due,  in  part,  to  a  failure  to  adequately  distinguish  Gc  from  Gfv. 

That  is,  measures  of  Gc  and  Gfv  shared  a  considerable  proportion  of 
variance.  Reducing  that  overlap  might  increase  the  chance  of  detecting 
differences  in  their  impact.  Thus,  future  research  in  this  area  must 
strive  to  do  this. 

The  Effect  of  GRAPH 

The  instructional  materials  in  this  study  used  many  graphic  displays. 
Therefore,  GRAPH  was  included  as  a  specific-ability  measure.  Significant 
main  effects  were  associated  with  GRAPH  at  both  testings,  even  after  account¬ 
ing  for  the  effects  of  general  ability.  As  with  general  ability,  students 
with  higher  aptitude  scores  did  better.  Thus,  learning  outcomes  were  not 
fully  described  by  the  effects  of  general  ability. 

Conclusions  and  Implications 

This  study  provided  evidence  that  neither  aptitude  nor  instructional 
treatment  alone  can  fully  describe  learning  outcomes.  Interactions  between 
them  exist  and  can  be  demonstrated.  Further,  instructional  supplements, 
whether  verbal  or  figural,  can  be  effective  in  filling-in  for  students 
weaknesses  and  reducing  differences  between  high  and  low  ability  students. 

Such  supplements,  however,  must  be  used  with  caution.  Reducing  the 
difficulty  of  instructional  materials  may  indeed  enhance  immediate  learning, 
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The  Effects  of  General  Ability 

On  the  immediate  posttest,  students  in  VE  and  FE  did  better  than 
students  in  MIN,  suggesting  that  the  elaboration  provided  in  these  con¬ 
ditions  helped  students  learn.  Significant  ATI  suggested  that  the  elabo¬ 
ration  was  particularly  useful  to  low  ability  students.  High  ability 
students  did  as  well  or  better  in  MIN.  Thus,  the  regression  of  a 
ment  on  general  ability  was  steepest  in  MIN  and  reduced  in  VE  and  Ft. 

Partitioning  of  the  total  test  score  by  item  type  indicated  that 
VE  was  particularly  helpful  on  verbal  items  and  FE  was  particularly  helpful 
on  figural  items.  Again,  significant  ATI  indicated  that  these  treatments 
were  particularly  helpful  to  low  ability  students.  Hence,  the  regression 
of  verbal  items  on  general  ability  was  least  steep  in  VE;  the  regression 
of  figural  items  on  general  ability  was  least  steep  in  FE. 

Examination  of  learning  outcomes  on  retention,  however,  led  to 
strikingly  different  conclusions.  While  students  in  MIN  were  worst  on 
average  achievement  on  the  immediate  posttest,  they  performed  best  on 
the  delayed  test. 

VE  and  FE  provided  more  information  to  learners  through  additional 
explanations  and  examples.  MIN  required  students  to  provide  this  infor¬ 
mation  for  themselves,  thereby  demanding  more  active  work  from  learners. 
More  able  students,  capable  of  doing  it,  did  equally  well  in  MIN  as  in 
VE  and  FE,  Less  able  students  that  could  not  generate  that  information 
for  themselves  benefitted  from  the  assistance.  The  gains  were  short-lived, 
however.  Active  mental  work,  necessary  in  MIN,  appeared  to  aid  retention. 
Hence,  there  was  a  greater  decline  in  performance  in  VE  and  FE  than  in 
MIN.  This  contention  was  further  supported  by  the  observation  that  losses 
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but  these  advantages  may  be  short-lived.  In  the  present  study,  increasing 
the  difficulty  of  the  work  required  for  initial  learning  appeared  to  in¬ 


crease  retention.  The  benefits  in  immediate  learning  must  be  weighed  with 
the  need  to  ensure  that  information  is  retained. 

The  implications  for  educational  practice  are  clear.  The  present 
study  indicated  that  instruction  that  enhances  immediate  learning  is  not 
necessarily  best  for  retention.  Ultimately,  educators  must  be  concerned 
with  how  much  information  is  retained  and  not  limit  their  concerns  to 
immediate  outcomes.  Thus,  not  only  must  achievement  be  assessed  at  more 
than  one  point  in  time,  curricula  must  be  developed  to  promote  long-term 
learning.  The  current  emphasis  on  testing  after  only  short  delays  should 
be  reconsidered. 

Additional  research  is  necessary  to  confirm  the  findings  of  this 
study  and  support  these  contentions.  ATI  research  is  one  avenue  for  ex¬ 
ploring  this  area,  but  it  should  be  supplemented  with  more  basic  research 
in  information  processing.  Improved  methods  of  distinguishing  Gc  from  Gfv 
are  required  to  explore  their  differential  impact  on  learning.  These 
methods  may  emerge  as  we  gain  a  better  process  understanding  of  these 
abilities  through  further  research. 

Finally,  researchers  should  examine  both  immediate  and  delayed  out¬ 
comes,  and  attempt  to  identify  instructional  conditions  likely  tc  promote 
long-term  retention.  At  a  min  mum,  delayed  outcome  measures  may  be  added 
to  instructional  research  conducted  in  different  contexts.  Additional, 
more  directed  research  may  probe  more  deeply  the  relations  among  instruc¬ 
tional  materials,  immediate  Learning,  and  retention.  The  present  study 
suggests  that  we  cannot  limit  research  to  immediate  outcomes  if  we  are 
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were  greatest  when  the  assistance  was  most  direct*  That  is,  losses  on 
verbal  items  were  greatest  in  VE;  losses  on  figural  items  were  greatest 
in  FE. 

The  Differential  Effect  of  Gc  and  Gfv 

The  differential  impact  of  Gc  and  Gfv  as  measured  in  this  study, 
did  not  enter  differently  into  outcome  on  aptitude  relations.  This  may 
be  due,  in  part,  to  a  failure  to  adequately  distinguish  Gc  from  Gfv. 

That  is,  measures  of  Gc  and  Gfv  shared  a  considerable  proportion  of 
variance.  Reducing  that  overlap  might  increase  the  chance  of  detecting 
differences  in  their  impact.  Thus,  future  research  in  this  area  must 
strive  to  do  this. 

The  Effect  of  GRAPH 

The  instructional  materials  in  this  study  used  many  graphic  displays. 
Therefore,  GRAPH  was  included  as  a  specific-ability  measure.  Significant 
main  effects  were  associated  with  GRAPH  at  both  testings,  even  after  account¬ 
ing  for  the  effects  of  general  ability.  As  with  general  ability,  students 
with  higher  aptitude  scores  did  better.  Thus,  learning  outcomes  were  not 
fully  described  by  the  effects  of  general  ability. 

Conclusions  and  Implications 

This  study  provided  evidence  that  neither  aptitude  nor  instructional 
treatment  alone  can  fully  describe  learning  outcomes.  Interactions  between 
them  exist  and  can  be  demonstrated.  Further,  instructional  supplements, 
whether  verbal  or  figural,  can  be  effective  in  filling-in  for  students 
weaknesses  and  reducing  differences  between  high  and  low  ability  students. 

Such  supplements,  however,  must  be  used  with  caution.  Reducing  the 
difficulty  of  instructional  materials  may  indeed  enhance  immediate  learning, 
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truly  interested  in  the  long-term  impact  of  instruction. 

In  conclusion,  this  study  examined  the  relations  among  aptitude, 
instruction,  and  learning.  While  it  provided  data  to  help  answer  some 
questions  in  this  area,  it  raised  many  others  that  only  future  research 
may  resolve. 
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APPENDIX  A 
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Means,  Standard  Deviations,  and  Correlations 


Note.  Decimals  omitted  from  correlations 
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Immediate  Posttest  Delayed  Posttest 
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Note,  Decimals  omitted  from 


Immediate  Posttest  Delayed  Posttest 
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